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^ | I Abstract. This paper studies two kinds of simulation between cellular automata: 

simulations based on factor and simulations based on sub-automaton. We show 
that these two kinds of simulation behave in two opposite ways with respect to 
the complexity of attractors and factor subshifts. On the one hand, the factor 
simulation preserves the complexity of limits sets or column factors (the simulator 
CA must have a higher complexity than the simulated CA). On the other hand, 
we show that any CA is the sub-automaton of some CA with a simple limit set 
(NL-recognizable) and the sub-automaton of some CA with a simple column factor 
(finite type). As a corollary, we get intrinsically universal CA with simple limit 
sets or simple column factors. Hence we are able to 'hide' the simulation power 
, of any CA under simple dynamical indicators. 

> 

(N 

^ '. Introduction 

' Since the introduction of the model in the 40s, cellular automata have been 

studied both as dynamical systems and as a computational model. In both aspects, 
they can show very complex behaviors, be it through their topological dynamics 
|Kur03] or through their ability to compute |vN66t 101103] . As such, they constitute 
a good model to tackle one of the major question of natural computing: what kind 
of dynamical behavior allows to support computation, and reciprocally, what does 
the ability to compute imply on the dynamical behavior of a system? 

In this p aper, w e focus on asymptotic dynamics of cellular automata (notion 
of limit set |CPY89p . and on their unprecise observation (notion of column fac- 
tors |Kur97j ). Intuitively, the limit set is the set of configuration that can appear 
arbitrarily far in the evolution of the system, and the column factor is the set of 
sequences of states that a cell (or group of cells) can take in a valid orbit of the 
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system. These notions have been intensively studied in the literature as indicators 
of the dynamical complexity of cellular automata |Hur90t IKar94| IBM97} ICFG10] . 

Concerning limit sets of cellular automata, the initial (wrong) intuition was that 
a universal CA shou ld necessarily have a non-recursive limit set (such a statement 
appears m |CPY89j ). Later, a Turing-complete CA with a simple limit set was 



constructed in [GMM93J. This result gives a first hint concerning the absence of 
correlation between the complexity of the limit set and the ability to handle com- 
putations. However, the construction goes through a slow simulation of two-register 
machines where registers are encoded in unary. Therefore, this results is extrinsic 
to the model of cellular automata and shows only that any behavior of register 
machines can be embedded into a cellular automaton with a simple limit set. 

Here, we aim at exploring intrinsic versions of the same question: what can be 
the limit set complexity of a CA able to simulate any other CA? More precisely, 
we consider two flavors of simulations: one using factor (continuous projection) of 
a simulator CA onto the simulated CA, and the other using the local uniform in- 
jection (sub- automaton) of a simulated CA into a simulator CA. Such simulation 
relations were extensively studied in [DMOTlObj . and the second flavor is the one 
giving rise to the notion of intrinsic universality |O1102| lDMOT10bj. The main point 
of the present paper is that factor simulations preserve limit set complexity whereas 
sub-automaton simulations do not. We also show that the same phenomenon ap- 
pears for the complexity of column factors. Our main results are two embedding 
theorems showing that there exist an intrinsically universal CA with column factors 
of finite type and an intrinsically universal CA with an NL-recognizable limit set 
(that is, its limit language can be recognized by a non-deterministic Turing machine 
in logarithmic space). The second result solves an open problem of |The05] . 



1. Definitions 

Let us note N + = N \ {0}. If i,j G Z, we note [i, jj the interval of integers k 
such that i < k < j, {i,j{ = — 1], }i,j{ = p + 1, j — 1] and so on. Consider 
a fixed finite alphabet A. If x G A z and C Z, then we note xjjjj G A*~ %+1 

the pattern corresponding to the sequence of letters Xi . . . Xj (and similarly for other 
kinds of intervals). 

If U C A k for some k G N and i G Z, the cylinder [U]i will denote the set of 
configurations x G A z such that xp^+fcj G U. We also note [U] = [U] . If a G A, let 
^a 00 be the configuration x G A z such that x,i = a for any i G Z. 

A dynamical system is a pair (X, F) where X is a compact space and F is a 
continuous self-map of X. We can then study iterations F l for any time step t G N. 

Let M stand either for N or for Z. The shift map a is defined for any z G A M and 
any i G M by a{x)i = Xi + \. A subshift is a dynamical system (E, a), or simply E, that 
is a subset of A M which is closed under the usual Tychonoff topology and invariant 
by the shift map. Equivalently, it is the set {z e A M \ V[i, j] C M,z ¥J} <£ J 7 } of 
infinite words that avoid the finite patterns from some given family T . If this family 
can be taken finite, then E is called a subshift of finite type (SFT). If it can be taken 
among words of length k G N + , it has order k. The language £(E) of a subshift E 
is the set {^[ij] z G E and C M} of patterns appearing in the infinite words 

of E. If this language is regular, then we say that E is a sofic subshift. Equivalently 
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thanks to the Weiss theorem |Wei73j , it is obtained from an SFT by a letter-to- letter 
projection. 

A cellular automaton (CA) is a dynamical system (A 1 , F) which commutes 
with the shift, i.e. aF = Fa. Equivalently, thanks to the Curtis-Hedlund-Lyndon 
theorem, it is obtained by some local map f : A 2r+1 — > A for some radius r G N, i. e. 
for any x G A 1 and « G Z, F(x)i = /(xjj_ r) j +T .]). F admits a spreading state G A if 
it admits a local rule / : A 2r+1 — > A with r G N and f(u) = whenever there exists 
i G [0, 2r] such that it* = 0. 

An SFT £ is irreducible if for any two words w, t> G £(£), there exists a word w 
such that G £(£). CA can actually be applied in a very natural way to these 
subshifts: a partial CA will be a dynamical system over some irreducible SFT that 
commutes with the shift. It is known from |Fio00] that any injective partial CA is 
bijective and reversible (the inverse is also a partial CA). 

1.1. Simulations 

The central notion studied in this paper is that of simulation between dynamical 
systems and more specifically cellular automata. We will distinguish two families of 
simulation relations based on the following notions. 

Definition 1.1 (Factors and sub-systems). 

• A factor map between two dynamical systems (A, F) and (Y, G) is a contin- 
uous onto map $ : X — > Y such that $F = L7$. In that case, we say that 
G is a factor of F. 

• A sub-system of a dynamical system (X, F) is a dynamical system of the 
form (Y, G) where Y C X (Y is closed) and F(Y) C Y. 

Note that a factor map $ between two subshifts £ and T respect the Curtis- 
Hedlund-Lyndon theorem: there exist a radius r G N and a /oca/ ru/e : £2r+i(£) 
such that = </>(xjj_ ri j +r .]) for any x G £ and any i G Z. We say that the factor 

map between two subshifts is a coloring if the radius can be taken r = 0. 

If (A, F) and (Y, G) are cellular automata, we say that (A, F) simulates (Y, G) 
by factor if there is a factor map $ from (A, F) onto (Y, G) which is also a factor 
map from (X,ax) onto (Y, ay)- Besides, when (X,F) is a cellular automaton and 
Y is a full-shift included in A, then (Y, F) is a sub-automaton of (A, F). 

These two relations (factor and sub-system) are restrictive since they don't allow 
entropy to increase: a factor or a sub-system of a given system has always a lower 
entropy. As a consequence, they don't support universality (existence of a system 
able to simulate any other) since it is not difficult to build systems of arbitrarily large 
entropy. Hence, in the literature, other ingredients were introduced to obtain richer 
notions of simulation: for instance, in cellular automata, operations of space and 
time rescaling added to the notion of sub-automaton lead to a notion of simulation 
supporting universality jDMUTlOal IDMOTlOb) . 

In this paper, such kind of spatio-temporal transformations are not considered 
explicitly for two reasons: 

• our results about factor simulation (Section EJ) involve properties (complexity 
of the subshifts) which are invariant by space and time rescaling, hence the 
results still hold when considering rescaling as part of the simulation relation; 
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• our results about sub-system simulation (Section [3]) are of the form "any 
CA is the sub-automaton of a CA with some given property", which is ac- 
tually the most general we can get, and remain true when replacing "sub- 
automaton" by more general notions of sub-system (such as sub-system, or 
simulated system in the sense of [DMOTlOb] ). 

1.2. Complexity of limit sets and column factors 

Many different points of view have been adopted to study the complexity of CA. 
We here use symbolic dynamics, and consider the complexity, as subshifts, of the 
limit set on the one hand, or the column factors on the other hand, as representing 
the actual complexity of the CA. 

The limit set of the dynamical system is the nonempty closed subset Qp = 
P) t n F t (X). Its limit system is the maximal surjective subsystem, (fl F ,F). It 
basically represent the asymptotic dynamics of the system. 

With respect to CA limit sets, it is not difficult to see that the corresponding 
language is always corecursively enumerable (it is an effective subshift). However, 
there are known examples of non- recursive ones |Hur90] . Moreover, simple additional 
remarks |Taa07[ IKur03t INas95] give the following hierarchy: 
F injective =>■ F\q f injective =>• Qp is an SFT ^ F is stable =>■ Qp is sofic =>- . . . 

The column factor of a dynamical system (A z , F) upon interval [i, jj C Z is the 

setrf^ 1 =T§ Ji (A z ), where T ^ : A% ~* (A^) N . It is a factor subshift 

since can represent an observation of the system made by a 

measuring device with a finite precision (that cannot see cells which are far away). 
It can be seen that any factor subshift is essentially a factor of some column factor 
|Kur97j . 

In the case of CA, shift-invariance allows to consider only the central column 
factors T F ~ k,k ^ for radius k G N + . It is known |Gil88] that these CA column factors 
always have a context-sensitive language; they may actually be strictly context- 
sensitive. In |Kur97] , strong links with topological notions are stated: finite column 
factors is equivalent to equicontinuity, SFT column factors imply the shadowing 
property which in turn imply sofic column factors. 

2. Factor simulations 

The two hierarchies that we have just defined, based on subshift classifications, 
are then very robust to factor simulations, as we can see in this section. Taking the 
vocabulary of order theory, we say that a class C of systems is an ideal for factor sim- 
ulation if, whenever (X, F) is a factor of (Y, G), we have: (Y, G) G C =>- (X, F) G C. 

Proposition 2.1. If is a factor map from (X, F) onto (Y, G), then — $(^f)- 

Proof. For any t G N, $F*(X) = G\Y). First note that $(Dt G N F \ x )) is included 
in n teN = riteN G W Conversely, let y G f] t€N G\Y) and, for t G N, 
X t = $ _1 (y) R F t (X). Note that X t is closed (since $ and F are continuous, and 
X is compact) and nonempty (since $ is onto). By compactness, the intersection 
Cl tgN X t = $ _1 (y) nQp is not empty, i.e. y G $>(flp). We have proven that $(f£p) = 
Q G . m 
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Moreover, we can note that Qpn = Q F , since if n G N + , then F nt (X) = 
C\( n -i)t<j<nt F^(X). In the following we will no longer mention time and space 
rescaling, which does not essentially alter the results. 

Corollary 2.2. The class of stable systems is an ideal for factor simulation. 

Proof. If $ is a factor map from (X,F) onto (Y,G), and Qp — F t {X) for some 
t G N, then Q G = $(Q F ) = $F f (X) = G\X). m 

We can also derive the two following results. 

Proposition 2.3. The class of CA whose limit set (resp. factor subshifts) is sofic 
(resp. context-free, context-sensitive, recursive) is an ideal for factor simulation. 

Proof. It is known that each of the corresponding classes of subshifts is preserved 
by factor maps. Proposition 12.11 states that the limit set of the simulated CA is a 
factor of the limit set of the simulating CA. 

Moreover, note, thanks to the transitivity of the notion of factor, that a factor 
subshift of the simulated CA is also a factor subshift of the simulating CA. ■ 

The following slightly generalizes a result in |The05] . 

Proposition 2.4. The class of reversible partial CA is an ideal for coloring. 

Proof. Let $ be a factor map based on some radius-0 local rule : A — )■ B, from a 
partial CA (E, F) onto another one (T,G), where £ C A z and F C B z and there 
exists some partial CA F^ 1 : £ — > E such that FF~ l is the identity. By surjectivity 
of $, for any letter b G B, there exists a letter, denoted abusively G A such 

that = b. If represents the parallel application of to B z , we 

obtain that is the identity of V. We can define the map G^ 1 = in 

such a way that GG" 1 is the identity of V . G^ 1 is an injective partial CA, hence it 
is reversible, and it is the actual inverse map of G. m 

As far as we know though, it is unknown whether the class of injective CA is 
still an ideal for factor simulation. 

Corollary 2.5. The class of CA which are reversible over the limit set is an ideal 
for coloring. 

Proof. Consider such a CA. From |Taa07j . it is stable and its limit set is an ir- 
reducible SFT. Now if it is linked to some other CA by some coloring, then by 
Proposition 12. 1[ so are the two limit systems (that are partial CA), which then re- 
spect the hypotheses of Proposition 12.41 ■ 

From the previous, we can see that any universal CA for factor simulation (up 
to rescaling) must have a non-recursive limit set and strictly context-sensitive factor 
subshifts (and of course must not be injective), but the existence of such a CA is 
still open. 

3. Sub-system simulations 

We will see in this section that, contrary to factor simulations, sub-system sim- 
ulations allow to hide the complexity of the simulated CA into the simulator CA. 
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3.1. Hiding the column factors 

If G is a CA over alphabet C of radius r > 0, local rule g, then let us define the 
CA G over alphabet D — {— 1, 0, 1} x C with the same radius r as G and local rule: 

D 2r+1 _^ D 

Oo,c eo ) if £ ^0; 
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(0, g(c_ r . . . c r )) otherwise. 



Clearly, G is a sub-automaton of G (up to state renaming) corresponding to the 
sub- alphabet {0} x C. 

Theorem 3.1. Any cellular automaton G is a sub- automaton of some CA whose 
column factors are SFT of order 2. 

Proof. Let us take a CA G over alphabet C, of radius 1. Let G be defined as above 
over alphabet C = { — 1, 0, 1} x C, k G N + and E its column factor of width k. Of 
course, E is included in its 2 -approximation 



A 2 {T) = {z = (z') t6N G (C"T Vt G N, 3a;' G [z*] n G ([z + ]) 

Let us show that they are actually equal. Let z = (V) teN G (G fc ) N be such that 
for any t G N there exists some configuration x l G [-2*] with G{x t ) G and a; 

defined by: 



x® if < i < k 



r, (— 1,6) if z < and xZ % {~ 1 = [£,c) 
[1, c) if i > k and x]^ k = (e, c) . 
An inductive application of the local rule gives that for any t G N, we have: 



(-he) 
(he) 



if < i < k 

if z < and x^" 1 = (e, c) 
if i > k and x^ k = (e, c) 



In particular, z = T|(x) G E. We have proven that E is an SFT of order 2. If G 

does not have radius 1, then it is easy to widen the radius of G (and increase the 
speed of the shifts) to get the same result. ■ 



3.2. Hiding the limit set 

The main result of this section is based on the existence of a firing-squad CA 
with specific properties expressed by Lemma 13.21 We actually refer to the firing- 
squad CA defined in [Kar94] . that we denote by S, and prove additional properties 
in Section HJ This CA admits a so-called firing state 7 and a spreading state k. Let 
rs the radius of S, s its local rule, Q its state set, and Q' = Q \ {^,7}- Consider 
the set Xs of configurations having an infinite history avoiding k and 7: 

X s = {y G Q z \ 3(y t ) teN G (Q z f,y° = y and Vt G N + ,y' G Q' and Stf) = y 1 ' 1 } . 

Lemma 3.2. S satisfies the following: 

(1) 00 7 00 G X s ; 

(2) ^n[ 7 ]cK 7 } z ; 

(3) X s is NL-recognizable. 
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Proof. Properties ([T]) and (J2J) are proven in [Kar94j . Property ([3]) is given by Propo- 
sition 14.101 below. ■ 

This construction allows to state the following theorem, proven at the end of 
the subsection. 

Theorem 3.3. Any CA is a sub-automaton of some CA whose limit set is NL- 
recognizable. 

By the existence of intrinsically universal CA for a simulation containing the 
sub-automaton relation (see for instance |OU03j ) and the transitivity of simulations, 
we can directly derive the following. 

Corollary 3.4. There exists an intrinsically universal CA whose limit set is NL- 
recognizable. 

The idea of the construction is the following: given some CA F over alphabet A, 
we add an extra (firing-squad) component which is able to generate any configuration 
of A z arbitrarily far in the future. The complexity of the limit set of F is thus 
completely flooded into the full-shift A z . All the technical difficulty is to control the 
contribution of the additional component to the final limit set. 

Let F be a CA of radius ry, local rule / over alphabet A with a spreading state 
G A. We define a CA A F<S of local rule 5p,s defined on alphabet C = A U (A x Q) 
with radius r = max(ry, r s ) by: 

f(a_ rp . . . a r . F ) if c = a_ r . . . a r G A 2r+1 ; (1) 

a if c = (a_ r , 7 ) . . . (a r , 7 ) G (A x { 7 }) 2r+1 ; (2) 

(a , s(b- rs . . . b rs )) if c = (a_ P , 6_r) . . . (a r , b r ) e(Ax Q') 2r+1 ; (3) 

otherwise. (4) 

Basically, this CA freezes the first component while applying the firing squad on 
the second component until some firing state appears, which then frees this second 
component and starts the application of F. When the configuration is not coherent, 
or when k appears, begins to spread. Clearly, F is a sub-automaton of Ap,s- 

The structure of the corresponding limit set will be given by the following lem- 
mas. 

Lemma 3.5. A z C ^a fs - 

Proof. Let x G A z . From Point [1] of Lemma I3.2[ there is a sequence (y^teN with 
y0 _ 00^,00 an( j £ or an y ^ g yt ^ {7,/?} and S(y t ) = J/* -1 . Consider now the 
configurations x t = (xi, yt +1 )iez for t G N+. By a quick induction on t G N+, we can 
see that for any cell i G Z, only case (3) of the local rule is used, and x° = A l F s (x*). 
At time —1, since the second component of x~ l is °°7 00 , case (2) of the rule is applied 
in every cell, which gives x = A^s^a; -1 ) = A^ 5 (a; - ') for any t G N+. ■ 

Lemma 3.6. Let x G &a fs an d hj £ ^ such that i / j and Xi = (ai,j),Xj = 
(dj, bj) G A x Q. Then bj G {7, k}. 

Proof. We can assume, by symmetry, that i < j, and for the sake of contradic- 
tion that bj £ {7, k}. Let (x*) tG z be a biorbit of bisequence of 
configurations such that Vt G Ap^si^) — x t+1 . By an easy recurrence and the 
fact that Xi G A x Q can only be obtained through case (3) of the rule, we can 
see that for any t G N, x^_ rt4+rt} can be written (a^, b~\ t ) . . . (a^ rt , fe^JJ G 
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(A x Q) l+2rt and s*(fe i J rst . . . 6 iH ! rst ) = in the same way, Xy_ rt j+r q can be written 
(aj^rt. • • • («i+rt, W) e (A x and s*(6T^ st . . . bjl st ) = b r Then for 

any t > 3 -^, ^i- 2 H,j+2rt} is in ( A x Q) 3 ~ l ~ 1+4rt and the image s^x^^^j) 
contains hi and fy. In other words, the cylinder [biQ 3 ~ t ~ 1 bj]i intersects S t (Q z ) for 
any t, and by compactness intersects tts, which contradicts Point [2] of Lemma \3. 21 ■ 

If S C A z is a subshift and G A, then we consider the subshift • S • = 

U-oo<«<m<+oo { x e ^ Z | V * ^ ll,ml,Xi = and 3y G T,,x VM = y\i, m {} of configura- 
tions or pieces of configurations of £ surrounded by 0. 

Lemma 3.7. Q Afs \A z cO«(AxQ) z iO. 

Proof. By shift-invariance, it is sufficient to prove that Qa fs H [A x Q] c • (A x 
Q) z »0. Let us prove by induction on n G N that the patterns of (AxQ)(A 2rn \{0 2rn }) 
are forbidden in ^a fs - The base case is trivial (there are no such patterns). Now 
suppose it is true for n G N, and suppose there exists a configuration x G [(A x 
Q)0 2rn+k {A\{0})] nQ AFS with 1 < k < 2r. Consider a preimage y G Q Af , s of x. On 
the one hand, in cell of y, we must have applied case (3), so yj_ r!+r j G (A x Q) 2r+1 , 
and this word does not involve 7. On the other hand, if we have applied case 
(1) in cell 2nr + k + 1 of y, then y^n— i)r+k+i,(2n+i)r+k+ij 

G (A \ {0}) 2r+1 , but 

the space between these two neighborhoods is (2n — l)r + k + 1 — r — 1 < 2nr — 
1, which contradicts the induction hypothesis. The other possibility was that we 
have applied case (2) in cell 2nr + k + 1, which involves a state 7 among cells of 
y[(2n-i)r+Jfc+i,(2n+i)r+Jfc+i]) which contradicts Lemma In the limit, and with a 
symmetric argument on the left, we obtain that all the configurations of Qa fs \ A z 
are in • E • 0. ■ 

We shall abusively denote A z x X$ = { (a«, Sj)iez G {AxQf\{ Sl ) l& eX s }. 

Lemma 3.8. Vl Aps = A z U • (A z x X s ) • 0. 

Proof. Thanks to Lemmas 13.71 and 13.51 it is enough to prove two inclusions for the 
configurations x G C z with I, m G [—00, +00] such that xj/ )Tn [ G (A x Q)" 1 ^ 1 ^ 1 and 
for any i £ ]/, m[, Xi = 0. 

First, suppose that x G ^a fs , i-e. for any t G Z, there exists x* G A^ 5 ({x}). By 
recurrence, we can see that x~* 6ixQ' for all i G ]/ — rt, m + ri[ and t > 1 since 
states from Ax Q are only produced by case (3) of the rule. Let G Q' m - l + 2rt + l 
be the projection of (x~*)]z_ rtjTO+r . t | on its second component. Clearly, w° is in the 
language of Xs- We deduce that x = x° G • (A z x X 5 ) • 0. 

Conversely, suppose that x G • (A z x Xs) • 0, ie. there is a sequence (y*) 
with, for t > 1, G Q' Z and = S(y t+1 ) and, for any t G N and any z G 
x = (tti, S t (y t )i) for some <2j G A. Now take the configuration y l G C z such that 
for any i Jl — rt,m + rt\, y\ = 0, and for any % G ]/ — rt, m + rt{, y\ = (bi, y\) 
with bi G A, and foj = a$ if i G ]/,77i[. By a direct recurrence, for any j < t 
and any % ^ ]/ — + rj, m + rt — rj\, we have A FS (y 3 )i = and for any i G 
Jl — rt + rj, m + rt — rj\, we have A FS (y 3 )i = (bi, S 3 (y 3 )i) (since yj G Q' G Z case 
3 of the definition of A^s applies at position i of y- 7 ). This gives that A FS (y) = x. 
We have proven that J] A n • (A x Q) z • = • (A z x X s ) • 0. ■ 
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Figure 1: The 16-state firing squad of |Kar94j . Empty spaces represent the blank 
state. 



Corollary 3.9. ^a fs has an NL-recognizable language. 

Proof. From Lemma 13.81 and Point |3] of Lemma 13.21 the language of the limit set is 
the finite boolean combination of finite concatenation of NL-recognizable languages. 

■ 

Proof of Theorem \3.c\ Let F be a CA on some alphabet A. We can artificially add 
some spreading state ^ A to build a CA F on alphabet A U {0} which admits F 
as a sub-automaton. Now we have seen that F is a sub-automaton of Ap s - From 
Corollary I3.9[ the corresponding limit set has an NL-recognizable language. ■ 



4. Analysis of a firing-squad CA 

More precise proofs can be found in the appendix. 

Let S be the firing-squad CA defined in [Kar94j. It has a state set Q of size 
16, including a killer state k, radius 1 and is defined by the transitions appearing 
in Figure [TJ precisely, any transition which is not in the space-time diagram of the 
figure produces the killer state k. The complete list of transitions is given in |Kar94j . 

We are interested in history diagrams in X$, i.e. mapping from Z x N to Q of 
the form: (z,t) i— >■ x t (z) where (x*) is a sequence of configuration in X$ such that 
S(x t+1 ) = x l . We call them valid history diagrams. 

When restricted to Xs, the behavior of S is easier to understand via a sig- 
nal/collision evolving in a quiescent background. More precisely, the background is 
uniform and made of blank states (denoted B in the sequel) and the signals involved 
are: 
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signal 


In 


h 


k 


#' 


T 2 


n 


Hi 


speed 


-1 


-1 


-1/2 





1/2 


l 


l 



The valid collisions are: 

Li + R x -»• Zi + n k + r 2 ^# 

h + r 2 ^h+r 2 h + #' + n -> # 

n + Z 2 ->■ ri + Z 2 # -> Li + l 2 + #' + r 2 + Ri 

Any other intersection of signal is invalid (it raises a k state). Moreover, the last 
collision rule (starting from a single jf) is valid only if the # is distant from any 
other # by at least 3 cells: if they are 1 cell away, 3 adjacent j^' are generated; if 
they are 2 cells away, a k is generated 2 steps later. 

To simplify proofs, we will often make reasonings over (portions of) "Euclidean" 
versions of history diagrams. A Euclidean history diagram is a set of labelled points 
and labelled (half-) lines or segments in IR 2 satisfying the following rules: 

• points are only at integer coordinates (Z 2 ) and labelled by 

• (half-)lines and segments correspond to signals listed above (label and slope 
correspond) ; 

• any intersection between lines or segments follow the collision rules above. 

Lemma 4.1. To each history diagram D, we can associate a valid Euclidean history 
diagram E such that, at any integer coordinate of E containing a point (#) or a 
single signal, the label gives the state of the corresponding position in D. 

This lemma allows the following proof scheme (used several times below): sup- 
posing by sake of contradiction that some word w occurs in a history diagram, we 
make a reasoning on the corresponding Euclidean diagram, we get a contradiction 
and finally deduce that no history diagram exists which contain the word w, and 
therefore that w is not in the language of X$- 

S satisfies points (1) and (2) of Lemma [3.21 as shown in (Kar941 Prop. 4.3]. We 
give below a complete characterization of the language of X$ which shows that it is 
NL-recognizable . 

Lemma 4.2. Consider a history diagram containing a word w G at time to- 

Let z\ (resp. z 2 ) be the cell where the first (resp. the last) letter of w occurs. We 
suppose in addition that the left # of w was created by a l\ signal. Let t\ be the first 
time step in the past when the cell z\ is in state and t 2 be the first time step in 
the past when the cell z 2 is in state Then, both t\ and t 2 exist. 

We denote by E the set Q' \ {#, #'}. 

Lemma 4.3. There is no history diagram containing a word w of the form 
where the left # is created by T\jl\ signals. 

Lemma 4.4. There is no history diagram containing a word w of the form 
where the left was created by a l 2 /r 2 pair of signals. 

Lemma 4.5. Any configuration from X$ with at least two # is of the following 
form, for some value of n: "(#5")" 

Lemma 4.6. Let L be the language of configurations from X$ admitting an history 
diagram where two # occur at some time t in the past. Then L G NL - recognizable 
in logarithmic space. 
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Lemma 4.7. The language of configurations from X$ which contain only one state 
in {#? #'} is a l so ^ n NL - 

Lemma 4.8. Let L be the language of configurations from X$ with two or more 
and having a history diagram with no Then L is regular. 

Lemma 4.9. The language of the configurations from X$ without any # or $ is 
regular. 

Proposition 4.10. The language of Xs is in NL. 

Proof. There are several cases, and the disjunction on configurations allows to ex- 
press the language of X s as a union of 'simple' NL languages. 

(1) Configurations with #s or #s. We can descibe the set of these configurations 
by: 

u {Li, h, B}{1 2 , B}*A{r 2 , B}*{R 1: r u BY 
where A is one of the following (possibly infinite) configurations: 

(a) A has exactly one state in #'}. We conclude in this case with 
Lemma 14.71 

(b) A has one and at least one other # or . Lemmas I4.3[ 14.41 and 14.51 
show that the configuration satisfy the hypothesis of Lemma I4.6[ which 
allows to conclude. 

(c) A has at least two but do not contain any signal. Then Lemma 14.81 
conclude. 

(d) A has at least two along with some signal(s) between two Denote 
by c the global configuration in this case. We can simply go back a few 
steps in the past to find out a configuration d of case [Tbl Then we can 
apply Lemma 14.61 to c. 

(2) Configurations without #s nor #'s. This case is treated in Lemma 14.91 
Thus, since we have described above why each of the possible languages could be 
recognized in NL, we can just build a non-deterministic machine beginning by making 
a non-deterministic choice between all of these machines, then doing the computation 
of the chosen one. ■ 



Conclusion 

We have thus achieved results implying that both limit set and column factors 
complexities are strongly linked to the factor simulation hierarchy; on the other 
hand, they are rather orthogonal to the sub-automaton simulation hierarchy. 

Many open questions remain. 

• We have obtained that universality was not forbidden by some rather strong 
constraints either on the limit set, or "orthogonally", on the column factors. 
A natural question is whether we can constrain both at the same time. The 
two constructions may possibly be composed together, at the price of a (yet) 
more difficult proof of the NL-recognizability of the limit set. 

• Similarly, we believe that our results still hold when alphabets are restricted 
to {0, 1} but at the price of a more technical proof. 

• Is there an intrinsically universal CA with an SFT limit set? Following 
our construction, this raises immediately the following question: is there a 
firing-squad CA with an SFT limit set? 
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• What kind of limit system can an intrinsically universal CA have? Can it 
be injective? 

• Is injectivity, expansivity of CA preserved by factor maps? 

• Is it enough, for a CA to be a factor of another CA, that the corresponding 
column factors with some given width be linked by a factor map? 

• Is there, for some complexity level A, an equivalence class for the sub- 
automaton simulation (with space-time rescalings) of which all the elements 
have limit sets of complexity A? 
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Appendix A. Proof of Lemma 14.11 

Proof. There is no rounding problem for signals of slope —1, and 1 (positions of 
cells of discrete signals correspond exactly to integer position on the continuous sig- 
nal). Concerning slopes 1/2 and —1/2, S is such that when two # at position p\ and 
P2 are connected by a 1/2 signal in a discrete space-time diagram, the displacement 
vector p2 — pi is always of the form (n, 2n) (the situation is similar for slope —1/2). 
More precisely, one can check that the set of cells {pi + (i, 2i) : i G N, i < n} are in a 
r 2 signal state, and it corresponds exactly to the integer positions on the continuous 
segment from p\ to p 2 - ■ 



Appendix B. Proof of Lemma 14.21 

Proof. First let us prove that t\ exists. Let us assume that there is no # in the past 
of cell z<i- The only possible past of this cell, in this case, is necessarily an infinite 
column of #'s. But we assumed that the left # was created by an l\ signal, which 
necesarily either crossed this column, or was generated by it. In both cases, this is 
a contradiction. 

Now that we know this, we prove that t 2 exists. Let us assume that cell z\ has 
only #'s in its past. Since the # in the right column was necesarily created by a 
signal coming from the left, be it r± or r 2 ; this implies that the signal should have 
"crossed" the left column, which is impossible. Thus, there is at least one # in this 
column. ■ 



Appendix C. Proof of Lemma 14.31 

Proof. Let us call Z the l\ signal that created the left # of w at t . According to 
Lemma 14.21 there is some time step ti in which a # appears in the past of the right 
(i.e. of column C). Moreover, let t 2 be the most recent step in which a # appears 
in the past of the # on column B at to- There are two cases: 

• This # in column C was created by an r± signal Z\. In this case, this # is 
necessarily the same as the one that generated Z. Else, applying the same 
reasonning as in Lemma [4.21 there would be another # between Z, and the 
# at ti, and then Z\ would have crossed both Z and the L\ signal emitted 
by this intermediate 

We now have two subcases: 

— The # at time t 2 in column B was created by h/r2- We deduce that 
the # on column C at time t\ was created by an l\jr\ pair of signals: 
otherwise we would have a /2/^2 intersection between the two columns, 
which is only possible on a #, but this would create a column of #'s 
visible at time to, which contradicts the hypothesis. Thus we can deduce 
that: 

(1) Column C existed before ti, and we can thus consider the first # 
on it before t\. This is the # which emitted the Z 2 signal creating 
the # on column B at t 2 , otherwise a l\ or Z 2 signal emmitted 
by this # would intersect column B between to and t 2 , which is 
impossible by minimality of t 2 . 
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(2) the r 2 signal arriving on the # of column B at time t 2 and the T\ 
arriving on the # of column C at time t\ meet in the past on some 
This allows to infer the existence of a third column A on the 
left of the two previous ones; 
We thus have the following situation: 

A d 2 B 



2di 




The distances imposed by the signals allow to conclude: 
d± + d 2 = 2d 2 + d' — di 
2d\ = d 2 + d' 

do 



(C.l) 



(C.2) 



d 2 + di = 2d 1 + d' 
2d 2 = d% + d! 

Equation (IC.ip comes from the r\ signal from the leftmost bottom 
and equation (IC.2I) comes from the rightmost bottom one. From this 
we can deduce that d\ = d 2 = d! and that the K\jr\ signal emitted 
at time t 2 by the central column reaches the right column at time to, 
contradicting the fact that the state at time to i n this right column is 

It was created by /i/r 1; and so was the right # at t\. In this case, one of 
them has to have another # in its past, by Lemma T4.21 Thus, this new 
# sends a r 2 or l 2 signal (depending on which one we consider), that 
necessarily collides either with the other column of #'s, or to the Li/Ri 
signals represented in thick on the figure below. Since this L\ signal 
creates the top left this collision should occur necessarily before time 
to and would create a k which is forbidden by hypothesis. 
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• Or it was created by h/r2- In this case, the only possible option for the # 
at t 2 on the left column is a h/r\, else there would be another column 
between the # and the #' at time t , contrarily to our hypothesis. 

Therefore we are in the exact symmetric of the case studied above, where 
we supposed that the # at t 2 was created by h/r 2 , and the left # at t x by 
li/r\. We can conclude to a contradiction by the same reasoning. 



Appendix D. Proof of Lemma 14.4 

Proof. We want to use the above Lemma 14.31 to prove this one. To do this, we 
need to prove the existence of a configuration of the form ^E*^' or its symmetric 
with in any case the # created by l\jr\ signals, in the past of our current 
configuration. 

We begin by proving the existence of another column of on the left of the 
left Since the right column of collides with Z - the l 2 signal creating the left 

there is necessarily a # in the past of this say at time t\. But then, this # 
is necessarily the one who emitted Z, for else it would have been formed by I2/T2 
signals colliding with Z, thus forming another between the # and the of our 
hypothesis. This shows that Z was originated by a # in the same column than the 
right 

But then, this # must have emitted also an L\ signal, that cannot collide with 
the T 2 signal (let us call it Z'), colliding with Z to form our left Thus, this signal 
must be converted into an li before meeting the r 2 of the hypothesis; more precisely, 
it must collide with an Ri signal. But this one necessarily has a finite origin, since 
it could not have crossed Z'. This shows the existence of another column of at 
t 2 - 
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% y 



# (,) # #' 




But then, since both ^s at t\ and t 2 emit a Li or Ri signal, and they are desynchro- 
nized, one of this signal has to hit the other column before to- m fact, to avoid this, 
we should have at the same time x + y > 2y and x + y > 2x, which is impossible. 
Thus, we can apply Proposition 14.31 or its symmetric to this step (at time t on the 
figure) . ■ 



Appendix E. Proof of Lemma 14.5 

Proof. We first need to show that in a configuration x G X$ containing #s, there is 
no history diagram yielding x in which two consecutive #s of x were both created 
by r 2 /l 2 signals, or both by r\jl\ signals. 

• if two consecutive #s were created by r 2 /l2 signals, these signals would have 
collided before, thus yielding another # between them. 

• if two consecutive #s were created by ri/Zi, the right r\ would originate in 
the left column, and vice-versa: 




These signals would then have been created by #s, which also would have 
emitted r 2 s and Z2S intersecting between the two #s in question, thus creating 
another 

Now if we have two consecutive ^s in a configuration, we can infer the position 
of a third one from the signals having created them, since they have different speeds 
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and thus must intersect. Then, we can reproduce this argument at the step where 
these signals were created. 

The only problem with this is that the infered # could be on the same side at 
each step of the argument, thus yielding a configuration with a rightmost # or a 
leftmost Without loss of generality, let us assume there is rightmost Then, 
the past of this cell would be an infinite column of #s/#'s, with infinitely many ^s. 
But then, some r2 signal emitted by a # of this column would collide with an Ri 
emitted later, which would yield a K state: the configuration we considered would 
not be in Xg. 

Then, it is easy to see that there can be no signal between two consecutive #s, 
for they would come from a desynchronized configuration in a history diagram of 
the configuration. ■ 



Appendix F. Proof of Lemma 14.61 

Proof. Applying Lemma 14.51 at time t, we now that the configuration at time t is 
of the form w (#B n ) w . Moreover, using again the argumentation of Lemma 14. 5[ we 
know that their is some time t' before t in history where the configuration is of the 
form w (#B n ) w with n' > n. Hence, L is exactly the language of forward orbits of 
periodic configurations of the form w (^B n ) w . It is straightforward to check from the 
definition of S (see Figure [I]) that the language of periods of such forward orbits is 
a finite union of languages of the form: 

#'B Xl Ai - ■ ■ B Xi Ai 

where i < 4, G X and where the x« must satisfy some simple linear equations. 
The lemma follows. ■ 



Appendix G. Proof of Lemma \4V7_ 

Proof. There are several possibilities for the number of # in the past of the # of 
There may be at most two, for else, two of them would have been created by li 
signals crossing the R\/ri emitted by the third and that intersection, circled in 
the following picture, can occur only once. 




Thus, we are left with: 
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• Two # in the past: in this case, the r 2 they emitted allows to guess the 
whole column. After the last r 2 signal, there may be signals in the opposite 
direction, that cannot collide in the past, i.e. first l 2 s, then Lis and lis. Such 
configurations are described, with x = 2z + y, by: 

{R u n, B} u {r 2l BYl x B x l 2 BVL x B z l 2 B z {#\ #}B z r 2 B z ■R 1 BVr 2 B x r 1 {L 1 , h, BY 

• With only one # in the past, there may still be some l\ signal somewhere: 

#' 



# 



Thus the configurations are of the following form: 

{R u n, B¥L x B*l 2 B z {#', #)B z r 2 B z R x {L x , h, BY , 

with possibly a r\ replacing some state on the left, and an l\ on the right, 
like on the figure. 

• With no # in the past, we have an infinite column of thus no signal can 
come from it. These configurations are of the form: 

U {R 1} n, B}{r 2 , 5}*#'{/ 2 , B}* {1^, h, BY 

The lemma easily follows from the characterization of the different possible forms 
of configuration discussed above. ■ 

Appendix H. Proof of Lemma 14.81 

Proof. In a history diagram satisfying the hypothesis their can be no collision that 
would generate a # and even no signal crossing (because this would mean that two 
signal are going in opposite directions in the same part of a configuration and thus 
that at least one would meet a in the past, which is forbidden by the hypothesis). 
Therefore we have: L = (B+{R U n})* (B + r 2 )* (#+#')* {B + l 2 )* {B+{L U h})* . m 

Appendix I. Proof of Lemma 14.91 

Proof. Without any nor we are left with blank states and signals. The con- 
figurations of the limit set are those configurations in which the possible signals do 
not have to collide in the past, thus fall in one of these three possibilities: 

u {luri,B} a 

"{ruh,BY 
> 1; R u B}{r 2 , B}*{1 2 , B}*{h, L u BY 




